INCLUSive: INtegrated Clustering, Upstream sequence retrieval and motif Sampling
نویسندگان
چکیده
INCLUSive allows automatic multistep analysis of microarray data (clustering and motif finding). The clustering algorithm (adaptive quality-based clustering) groups together genes with highly similar expression profiles. The upstream sequences of the genes belonging to a cluster are automatically retrieved from GenBank and can be fed directly into Motif Sampler, a Gibbs sampling algorithm that retrieves statistically over-represented motifs in sets of sequences, in this case upstream regions of co-expressed genes.
منابع مشابه
Integrating quality-based clustering of microarray data with Gibbs sampling for the discovery of regulatory motifs
In microarray experiments, genes exhibiting a similar expression profile are potentially coregulated. Clustering identifies such groups of coexpressed genes, whose upstream regions can then searched for putative regulatory elements. We present two algorithms and an interactive web-based user interface that integrate cluster analysis and motif finding for the analysis of microarray data. Startin...
متن کاملINCLUSive: a web portal and service registry for microarray and regulatory sequence analysis
INCLUSive is a suite of algorithms and tools for the analysis of gene expression data and the discovery of cis-regulatory sequence elements. The tools allow normalization, filtering and clustering of microarray data, functional scoring of gene clusters, sequence retrieval, and detection of known and unknown regulatory elements using probabilistic sequence models and Gibbs sampling. All tools ar...
متن کاملBioProspector: Discovering Conserved DNA Motifs in Upstream Regulatory Regions of Co-Expressed Genes
The development of genome sequencing and DNA microarray analysis of gene expression gives rise to the demand for data-mining tools. BioProspector, a C program using a Gibbs sampling strategy, examines the upstream region of genes in the same gene expression pattern group and looks for regulatory sequence motifs. BioProspector uses zero to third-order Markov background models whose parameters ar...
متن کاملCSE 527 Lecture 4 , 10 / 08 / 03
Notes by Ana Kristine Torgerson" atorgers@u Case study, continued Sporulation summary What they did Measured mRNA expression levels of all 6200 yeast genes at 7 times points in a (loosely synchronized) sporulating yeast culture Plus some more standard tests and controls What they learned 3-10X increase in number of genes implicated in various subprocesses Several subsequently verified by direct...
متن کاملEvolutionary Monte Carlo Methods for Clustering
The problem of clustering a group of observations according to some objective function (e.g., K -means clustering, variable selection) or a density (e.g., posterior from a Dirichlet process mixture model prior) can be cast in the framework of Monte Carlo sampling for cluster indicators. We propose a new method called the evolutionary Monte Carlo clustering (EMCC) algorithm, in which three new “...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 18 2 شماره
صفحات -
تاریخ انتشار 2002